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ON  TOE  CHOICE  OF  VARIANCE 
FOR  TOE  LOG  RANK  TEST. 

by 

Mark  Brown 


ABSTRACT 

The  log  rank  test  is  widely  used  for  comparison  of  survival  curves. 

This  paper  examines  various  estimators  for  the  variance  of  the  log  rank 
test  statistic.  These  include  the  Mantel-Haenszel  variance,  the  permu¬ 
tation  variance  of  Peto  and  Peto  and  several  newly  proposed  estimators. 
The  results  generalize  to  a  wide  class  of  test  statistics  for  the  two 
sample  problem  with  censored  data. 
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1.  Introduction.  The  log  rank  test  is  widely  used  for  comparison  of 


survival  curves.  There  are  two  versions  of  the  test  which  differ  in  the 
choice  of  variance  for  the  same  test  statistic  T.  The  MH (Mantel -Haenszel) 
variance,  an  estimate  of  Var  T  which  is  unbiased  independent  of  differences 
in  censoring,  and  the  permutation  variance  of  Peto  and  Peto,  for  which  the 
variance  is  computed  under  the  assumption  of  equal  censoring. 

This  paper  examines  the  choice  of  variance.  It  is  observed  that  for 
equal  sample  sixes  the  MH  variance  tends  to  underestimate  the  true  variance 
when  T  is  large  in  absolute  value.  As  this  is  the  situation  where  rejection 
of  the  null  hypothesis  is  to  be  seriously  considered,  it  appears  that  the 
true  type  1  error  will  tend  to  be.  higher  than  that  assumed,  and  the  reported 
P  value  will  tend  to  be  too  low.  Examples  with  small  sample  sixes  are  given 
to  illustrate  this  phenomena.  Further  work  is  needed  to  see  if  P  values  are 
seriously  distorted  for  larger  samples. 

In  section  4  it  is  shown  that  under  equal  sanple  sizes  the  permutation 
variance  tends  to  overestimate  the  true  variance  when  censoring  is  unequal. 

It  thus  appears  prudent  to  use  the  permutation  variance  rather  than  the  MH 
variance  in  assessing  significance. 

In  section  5  alternative  variance  estimators  are  proposed. 

The  log  rank  results  generalize  to  a  wider  class  of  test  statistics. 
This  point  is  discussed  in  section  6. 


2 


2.  Review  of  tne  log  rank  test.  Sample  1  consists  of  n  individuals  and 
sample  i  of  m  individuals.  For  eacn  we  observe  eitner  the  time  of  death 
or  censoring  wnichever  occurs  first,  and  we  know  which  of  the  two  takes 
place.  For  simplicity  we  assume  that  no  ties  occur  among  the  k  uncensored 
observations  in  the  combined  sample  which  are  labeled  as  t}  <  t2  ...  <  t^. 
define  to  be  the  number  of  patients  in  the  combined  sample  at  risk  at 
time  t7,  i.e.  those  for  whom  the  minimum  of  the  death  and  censoring  times 
are  i  t^.  Similarly  *s  tlie  number  of  patients  at  risk  from  group 

1(2)  at  define  6^  *  1  if  the  death  occuring  at  t^  is  from  sample  1,  0 

k 

if  from  sample  2.  define  N  *  £  5.,  the  number  of  uncensored  observations 

1  1 

from  sample  1.  The  hypothesis  of  interest  is  iiQ:  The  survival  distributions 
of  groups  1  and  2  are  the  same.  The  alternative  can  be  one  or  two-sided, 
the  one-sided  alternative  favoring  group  2  survival  would  be  :  The  hazard 
rate  for  group  1,  h^,  exceeds  that  for  group  2,  h2,  in  the  range  of  interest. 
The  log  rank  test  statistic  is  defined  by: 


(2.1) 


T 


K 


l 

i=l 


'li 


If  censoring  is  assumed  to  oe  independent  of  survival  then  inde¬ 
pendent  of  differences  in  censoring  for  tne  two  groups  we  have: 


(2.2) 

L  T  =  0 
0 

(2.3) 

I:  T  >  0 

li. 

The  test  of  vs  under  the  MH(Mantel-Haenszel)  approach  is  to 
reject  Uq  for  T  large:  the  distribution  of  T  under  is  approximated  by  a 
normal  distribution  with  mean  0  and  variance: 
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(2.4) 


r  RliR2i 

il 


The  NH  variance,  o^,  is  an  unbiased  estimator  of  VarH  T,  independent 


H. 


of  differences  in  censoring  between  the  two  groups. 

Peto  and  Peto  CIO]  observed  that  under  the  assuaption  of  equal  censoring 
a  test  based  on  the  log  rank  statistic,  T,  can  be  interpreted  as  a  sua  of 


scores  rank  test.  Define  the  score  of  a  patient  with  observation  as 

h  •  ‘i  - ,  l  &  “he” 


i  t 


1  if  is  uncensored . 
0  if  Z^  is  censored. 


Then  T  «  J  U.. 
group  1  c 


The  permutation  variance  of  T  is  easily  computed  as: 

K 


<2‘4>  °p  *  T^-.-iT 


K  ■  l  fr\ 

i-l  i1 


The  permutation  test  based  on  T  is  a  conditional  test  given  A,  where  A 

is  the  sample  outcome  with  group  identity  of  patients  deleted.  In  the  absence 

of  equal  censoring  neither  the  MH  test  nor  the  permutation  test  based  on  T 

are  conditional  tests  given  A.  The  conditional  mean  of  T  under  ttQ  given  A 

2 

depends  on  the  censoring  and  will  generally  not  be  zero.  Furthermore 

is  generally  not  conditionally  unbiased  for  Var„  (T|A). 

M0 
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3.  A  discussion  of  the  M-H  variance.  Suppose  that  n  ■  ».  Below  is  a 

plausibility  argument  for  the  assertion  that  the  MH  variance  tends  to 

underestimate  the  true  variance,  Varu  T,  when  T  is  large  in  absolute  value. 

n0 

A  consequence  of  this  would  be  exageratedly  small  P  values  under  the  MH 
test.  The  plausibility  argument  is  difficult  to  quantify.  Illustrative 
examples  with  small  sizes  are  given,  which  do  indicate  the  existence  of  a 
problem.  Further  work  is  needed  to  examine  how  serious  the  problem  is  for 
larger  sample  sizes. 

Consider  the  sample  points  which  lead  to  a  large  absolute  value  of  T, 
and  thus  lead  the  statistician  to  consider  rejection  of  HQ.  In  general  they 
are  characterized  by  an  excess  of  uncensored  observations  from  one  of  the 
samples.  This  leads  for  many  i  to  an  inbalance  between  Rj^  and  1^  causing 


li  2i 

”72 — 


to  be  small,  and  consequently  to  be  small. 


Large  values  of 


|t|  tend  to  be  accompanied  by  small  values  of  causing  exageratedly 
large  M/c^,  and  P  values  which  are  too  small.  The  statistician  is  observing 

Rli 

close  to  zero  or  one,  seeing  that  it  has  resulted  largely  from  an  in- 
Ki 

balance  in  deaths  rather  than  censoring,  and  then  estimating  the  variance  at 


Rli  R2i  Rli 

that  stage  by  ■  ■■  g-  ,  is  ignoring  the  fact  that  under  Hn  -s—  should  not  be 

R?  0  Ki 

as  extreme  as  in  the  current  case.  As  the  statistician  is  attempting  to 

estimate  the  variance  under  Hq,  he  is  ignoring  an  important  consideration. 

The  fact  that  in  other  cases  (|T|  small)  ignoring  similar  information  leads 

2 

to  an  overestimate,  and  the  two  balance  out  to  make  unbiased,  does  not 
alleviate  the  difficulty. 


s 


Below  are  two  cases,  n  •  m  »  3,  and  n  ■  m  -  4,  both  without 
censoring . 

2 

Table  1  below  considers  n  *  m  ■  3.  The  values  of  appear  in 

column  1.  In  column  2  the  average  value  of  |T|  is  computed  for  sample 

2  2  2 
points  with  common  a  Note  that  as  o *  increases  EL |T| |o‘  ^decreases. 
r  mh  MH  '  MH 

The  sample  correlation  coefficient  between  and  |t|  equals  -.86. 

2 

The  average  of  over  tne  20  sample  points  equals  the  permutation 

variance  1.065.  If  the  ranks  1,  2,  3  are  observed  for  sample  1  then 

the  exact  P  value  is  .05;  the  P  value  using  a  normal  approximation 

2 

with  permutation  variance  is  .0365;  the  P  value  using  o^  *  .68  as 
approximating  variance  is  .0124. 

Table  1  (n«m«3;  no  censoring) 

2  2 

o*  E(|T||e.„,)  number  of  cases 

Mn  Mil 


.68 

1.85 

2 

.yo 

1.52 

2 

.06 

1.17 

4 

1.15 

.52 

4 

1.21 

.50 

8 

Table  2  is  similar  to  table  1  with  n  ■  m  *  4.  In  this  case  the 

2 

sample  correlation  coefficient  between  oj^  and  |t|  equals  -.88,  slightly 
higher  in  absolute  value  than  for  n  ■  m  ■  3.  Tne  permutation  variance 
equals  1.51. 


Table  2  (nin«4;  no  censoring) 


.  2 
°MH 

number  of  cases 

.88 

2.54 

2 

1.06 

2.29 

2 

1.14 

2.09 

2 

1.17 

1.85 

4 

1.29 

1.95 

2 

1.37 

1.75 

2 

1.39 

1.52 

4 

1.43 

1.17 

4 

1.46 

1.17 

8 

1.54 

.95 

4 

1.62 

.75 

4 

1.64 

.54 

8 

1.68 

.50 

8 

1.71 

.50 

16 

Table  3,  again  for  n  *  m  -4  without  censoring,  lists  the  7  largest 

12  7 

values  of  T  with  corresponding  exact  P  values  jq,  ....  -jq,  and  P  values 
under  the  normal  approximations  with  both  MH  and  permutation  variance.  It 
is  seen  that  the  P  values  are  systematically  too  low  using  the  MH  variance. 


Table  3 


value  of  T 

Exact  P  value 

2 

Ojyjn  p  value 

2 

Op  P  value 

2.54 

.0143 

.0035 

.0194 

2.29 

.0286 

.0133 

*0312 

2.09 

.0429 

.0255 

.0446 

1.95 

.0571 

.0424 

.0558 

1.92 

.0714 

.0380 

.0592 

1.78 

.0857 

.0502 

.0738 

1.75 

.1000 

.0667 

.0770 

The  fact  that  in  table  3  the  permutation  variance  gives  a  better  approxi 
nation  is  not  surprising,  k'e  are  computing  the  exact  P  value  under  the  con¬ 
ditional  distribution  of  T  given  A,  under  the  assumption  of  equal  censoring. 
This  it  the  "home  court"  for  the  permutation  variance.  At  this  point  we  are 

not  arguing  the  virtue  of  the  permutation  variance,  but  are  illustrating  the 
2 

behavior  of  o.^,  described  earlier. 


Note  that  when  n  ■  m  and  censoring  is  equal  a  simple  symmetry  argument 
2 

shows  that  Cov(o^,T)  -  0.  Thi.  is  an  example  for  which  two  random  variables 

are  uncorrelated  but  highly  dependent. 

If  one  could  show  that  the  MH  variance  is  the  conditional  variance  of 

T  given  an  appropriate  function  of  the  data  for  which  the  conditional  mean 

2 

of  T  remains  zero,  then  the  above  mentioned  behavior  of  o^  would  no  longer 
be  disturbing.  The  MH  variance  would  be  the  appropriate  variance  for  a  well 
defined  conditional  experiment.  No  such  interpretation  has  been  offered 
however  and  it  is  clear  by  simple  counterexamples  that  any  such  interpretation 
would  have  to  be  asymptotic  rather  than  exact. 


r 


8 


4.  A  property  of  the  permutation  variance.  Consider  the  case  of  equal 
sample  sizes  and  random  censoring,  with  the  group  1  censoring  distribution, 
H,  allowed  to  differ  from  the  group  2  censoring  distribution  I.  Recall  that 


2  n 

°P  "  3T3HTTT 


K  i 

Hr 

i  i 


,  the  permutation  variance,  is  a  function  of  (K,R^, . . . ,RR) 


which  under  equal  censoring  is  the  conditional  variance  of  T  given  (K,R^. . . .R^) 

2  '2  ~2 

In  theorem  1  below  it  is  shown  that  ^  i  ,  where  o  is  a  function  of 
(K,R, , . . . ,R„)  depending  on  (F,H,I)  which  is  unbiased  for  Var„  T.  Thu.  for 

IK  Hq 


any  (F,H,I),  E  a*  2  E  {2)  =  Var  T. 

n0  p  0  0 


2 

In  view  of  this  property,  the  fact  that  .r  does  not  vary  with  T  for 
2  2 

fixed  (K,R1# . . .  ,Rr)  as  does,  and  the  difficulty  with  discussed  in 

section  3,  it  would  seem  prudent  under  approximately  equal  sample  sizes  to 
2  2 

use  rather  than  in  assessing  significance. 

The  following  two  lemmas  are  needed  in  the  proof  of  theorem  1. 


1 .  Let  a 


a*. 

u 

• 

N 

a  * 

• 

and  = 

e 

• 

a 

l  “j 

K 

(0,1,..., m}  with  an  *  and  — -  & 

m 

stochastically  larger  than  |3  i.e.  \ 

r 

m  a . 


|  III  Ul  M  . 

Proof.  For  any  k,  —  \  a.  •  l 

“k  k  3  j=k  “k 


m 

1*  I 

j«k*l 


l 


k+1  ai-l 


m 

I 

j»k+l 


J 

n 

k+1 


be  two  probability  distributions  on 


- - ,  i  «  1,  ...,  m.  Then  a  is 

Bi-1 

m 

o<i  2  I  6i»  r  »  0,  . . . ,  m. 


1 

Is..  Theref  ore 
ek  k  3 
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hk(f5) "  *  nr-  -  hic (a)  * so  l  -  Va-^w)  s  V(i-h^(«»-  J  ak.  || 

k6j  l°i 

Lemma  2.  Let  X  BfnjPj),  Y  'v  B(n,p2).  Then  E[X(r-X)  |X*Y»r]  is  maximized 


when  p^ 
Proof. 


■  P2»  Its  maximum  value  is 
(i)  Case  1,  r  £  n.  Define: 


(4.1) 

and 

(4.2) 


Note  that: 


(4.3) 
where 

(4.4) 


Y^c) 


n  n  ,  j  r-i.  r 

h  r-j  p  ’  ’  J  2 

f  n  )2  r/2  r 

[r/2j  c  '  3  "  T*  r  even 


YjCc) 

<>i (c)  ■  - »  i  B  0.  ....  0/2) 

l  Y.:(c) 
j=0  3 


E(X  (r-X) |X+Y»r) 


*  l  l{r-l)aAc) 
0  *■ 


c 


pjqi 

P2^2 


Since  is  increasing  in  the  range  [0,[j)3,  (4.1)  will  be  maximized 

by  that  value  of  c(if  such  a  value  exists)  for  which  the  probability  distri¬ 
bution  a(c) ,  defined  by  (4.2),  is  stochastically  largest. 

We  now  prove  that  c  ■  1  gives  the  stochastically  largest  distribution. 

By  lemma  1  it  will  suffice  to  prove  for  abritrary  positive  c  that: 


(4.5) 

(4.6) 


a0(l)  *  a0(c) 

•^(l)  ai(c) 

0^(1)  2  <ii_1(c)*  1  = 


0/2] 
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(4.7) 


The  statement  (4.5)  is  equivalent  to: 

)  hsL  *  l  U  UJ  |ckt'£ 


:o  (?) 


l  T-l 

c  +c 

Hhe  right  side  is  a  weighted  average  of  - ^ - while  the  left  side 

,  r 

uc  y  T—X 

puts  all  its  weight  on  the  value  — .  Since  the  function  g(x)  »  c  ♦  c 
is  decreasing  in  [0,r/2.)  and  g(x)  =  g(r-x),  it  follows  that  for  fixed  c  g 
is  maximized  at  x  -  0  or  equivalently  at  x  >  r.  Thus  (4.5)  is  proved. 


To  prove  (4.6)  note  that: 


(4.8) 


^(c) 


l  r-i 
c  +c 


r  odd,  i  «  1,  .... 


o.  , (c)  i  i-1  r-i-1*  .  r-2 

i-I v  1  c  +c  x  even,  i  =  1, 


(4.9) 


<^(<0 


,  r  even,  i  *  r/2 


where  X^  is  a  constant  independent  of  c.  In  both  cases  ((4.8)  and  (4.9)  a 
simple  differentiation  argument  shows  that  the  maximum  is  achieved  at  c  *  1. 

This  concludes  the  proof  of  case  1. 

(ii)  Case  2,  r  £  n.  Suppose  r  >  n.  Then  (n-X)  +  (n-Y)  ■  2n  -  r  <  n,  thus 
by  case  1,  E[(n-X)[2n-r)-(n-X) Jjx+Y=r]  is  maximized  by  choosing  Pj  *  p^.  But 
(n-X)[(2n-r)-(n-X) j  =  X(r-X)+n(n-r) ,  Thus  maximizing  E[(n-X)[(2n-r)-(n-X) ]jx*¥-r 
is  equivalent  to  maximizing  E(X(r-X) |x+Y*r) ,  and  case  2  is  proved. 


Finally  note  that  when  p^  =  p^, 


11 


Theorem  1.  Assume  n  ■  m.  Let  the  censoring  variables  for  sample  1  (2)  be 
i.i.d.  with  distribution  H(I).  Assume  that  the  censoring  variable*  are 
independent  of  the  survival  variables  and  that  the  two  groups  are  indepen¬ 
dent  of  each  other.  Then: 

(4.10)  E  a2  2  Var  T 

H0  p  H0 

Proof.  The  proof  below  is  heuristic  but  can  be  made  rigorous  using  martin¬ 
gales  .  For  j  ■  1 ,  2  define  R^ (u)  to  be  the  number  of  group  j  patients  at 
risk  at  time  u";  define  R(u)  ■  Rj(u)*R2(u).  Define  H(u)  to  be  the  history 
of  the  process  until  u”;  this  includes  all  censored  and  uncensored  observations 
and  knowledge  of  group  membership  of  the  patients  involved.  Define  N(t)  to 
be  the  number  of  uncensored  observations  in  L0,t3,  and  let  h  denote  the 
common  hazard  rate  function  for  the  two  groups.  Note  that  under  Hp,  dN(u) 
and  H( u)  are  conditionally  independent  given  R(u),  with 
E(dN(u) |H(u))  -  E(dN(u)|R(u))  «  R(u)h(u)du.  It  follows  that: 

E,.  (g(H(u))dN(u)]  -  Eh  [E„  (g(H(u))|R(i$)dN(u)3 
‘*0  0  0 

-  E^CE^j^CgCHtu))  |R(u))R(u)h(u)du3 
2  ACuJRjCu) 

Now  cu.  *  J~~ — o - dN(u)  is  unbiased  for  Varu  T.  Defining 

R2(u)  H0 

a2  m  — E(R, (u)R,(u) |R(u))dN(u) ,  and  using  (4.11)  with 
R  (u)  1 

g(H(u))  »  Rj(u)R2(u),  we  obtain; 

(4.12)  Eh  o2  .  VarH  T 

M0  M0 
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*2 

We  wish  to  compute  a  under  known  (F,H,I).  For  convenience  we  adopt 
the  convention  of  using  left  continuous  rather  than  right  continuous  cdf's. 

Thus,  for  example,  H(x)  ■  PH(X<x) ,  and  H(x)  ■  PH(X2x).  Now  Rj  (uj'tBCn.FoOHfu)) 
and  R2(u),vB(m,F(u)I(u)),  where  B(n,p)  denotes  the  binomial  distribution  with 
parameters  n  and  p.  By  lemma  2,  E^fRj  (u)R2(u)  |R(u))  is  maximized  when 
H(u)  -  I(u),  the  maximum  value  being  2('2n-l)RfrO  (R(u)-l) . 

Thus  0^  -  R(u)(R(u).1)'|dN(u)2j_lE  [R  (U)R  (u)|R(u)]dN(u)»o2. 

y  R(u)[l  J  Rz(u)  "o  1  1 

and  E„  o2  2  E  a2  =  Varu  T. 

H0  p  H0  H0 
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S.  Alternative  variance  estimators.  It  was  shown  in  section  4  that  a  is 


unbiased  for  Var.  T  and  in  the  case  of  equal  censoring  reduces  to  the  per- 

Ho 

u  _  2 

mutation  variance  o^;  under  equal  censoring  is  the  appropriate  variance 

for  a  conditional  test  given  A. 

"2 

The  estimator  a  also  works  well  in  the  following  case.  Assume  that 

all  group  1  observations  are  censored  at  s  and  all  group  2  observations  at 

times  which  exceed  s.  Define  N(s)  to  be  the  number  of  uncensored  observations 

”2 

from  the  combined  groups  in  LO.s).  Then  cr  reduces  to: 

;(s). 


(5.1) 


nm  r-  iiTiB-j. 

(n*m) (n+m-1)  l  n«-m-i<*l 


'n+m-i 


Note  that  (5.1)  is  unbiased  for  Var„  T  and  is  a  function  of  N(s).  Under 

0 

Hq  the  conditional  distribution  of  T  given  N(s)  is  independent  of  F  and 

Eu  (T|N(s))  «  0,  therefore  Varu  (TfN(s))  is  also  a  function  of  N(s)  which  is 
H0  H0 

unbiased  for  Var„  T.  Since  N(s)  is  complete  (follows  from  completeness  of 
0 

-  ? 

the  binomial  family)  it  follows  that  o  «  Var„  (T|N(s)).  Thus  once  again 

H0 

"2 

a  reduces  to  a  conditional  variance  and  is  the  appropriate  variance  for  a 
conditional  test. 

“2 

One  further  example  to  illustrate  the  use  of  a  :  Suppose  that  all 
patients  enter  the  study  at  the  same  time,  censoring  is  due  only  to  termi¬ 
nation  of  the  study,  and  the  study  terminates  at  w^,  the  time  of  the  kth 

“2 

smallest  order  statistic  from  the  combined  sample.  In  this  case  o  is 
easily  computed  to  be: 
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Since  (5.2)  is  unbiased  for  VarH  T  and  is  a  constant,  it  coincides  with 

M0 

Var,.  T.  This  implies  that  Var  T  is  independent  of  F,  a  fact  which  can 
HU  H0 

easily  be  independently  verified.  Note  that  in  the  special  cases  k  ■  n+a  and 

2 

k  >  n*m-l  (5.2)  coincides  with  a  . 

P 
-  2 

Note  that  o  can  only  be  computed  under  censoring  assumptions.  For  the 

”2 

three  specific  cases  discussed  above,  o  gave  useful  and  appropriate  variances. 

(5.3)  Case  of  known  censoring  times.  Suppose  that  all  censoring  times,  including 
those  corresponding  to  uncensored  observations  are  known.  For  example,  if  the 
data  is  analyzed  at  time  T  and  the  only  cause  of  censoring  is  termination  of 

the  study,  then  a  patient  entering  the  study  at  time  x  has  a  censoring  time 
of  T-x. 

In  this  case  we  think  of  the  censoring  times  as  fixed  numbers.  Hypothetical 

repetitions  of  the  experiment  yield  different  survival  times  but  the  same  fixed 

censoring  times.  The  variance  of  the  statistic  T  under  HQ  will  depend  on  F. 

'2 

This  variance  can  be  unbiasedly  extimated  by  a  .  To  do  so  we  need  to  compute 

Eh  (R. (u)R_(u) |R(u))  under  known  censoring  times.  Define  £.(u),  j  ■  1,  2  to  be 
n0  1  1  J 

the  number  of  group  j  patients  with  censoring  time  a  u,  and  £(u)  ■  £j(u)+£2(u). 
There  are  £(u)  potential  members  of  the  risk  set  at  time  u  all  equally  likely 
under  HQ  to  be  included.  The  conditional  distribution  of  R^(u)  given  R(u)  is 
thus  hypergeometric.  It  easily  follows  that: 

l  (u)£  (u)R(u)(R(u)-l) 

(5.4)  E  (R  (u)R  (u)|R(u)  -  — - 1 - 

M0  1  2  £(u)(£(u)-l) 

To  distinguish  the  estimator  o  in  the  case  of  known  censoring  from  pre- 

*2 

viously  considered  cases  we  will  denote  it  by  a  .  Then  from  (5.4)  and  (4.11): 


I2QZ 
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(5.5) 


“2  r  *li42i(Ri_1) 
a  •  l  - 

i-1  4i(fi- 1)^ 


(5.6)  E„  o2  -  Varu  T 

H0  u 

where  ^  -  -Mtj),  j  *  1,  2  and  ^  +  l2i. 

Note  that  when  ^  *  Rji»  ^i  *  R2i*  i  *  1#  •••»  K»  then  °2  *  q2mh* 

*2  2 

Thus  o  and  o  ^  will  be  close  together  when  most  of  the  uncensored  obser¬ 
vations  occur  very  shortly  before  their  censoring  times.  Also  note  that 

*2  2 

when  ■  n,  f2i  «  m  then  o  ■  op.  The  reason  for  this  is  that  under  equal 
censoring  all  n+m  patients  have  the  same  probability  under  HQ  of  being  in  the 
risk  set  at  time  u. 

(5.6)  Remark.  The  exact  variance  in  the  case  of  known  censoring  times  is 


computed  by: 


Varu  T  - 
H0 


1  u)^(u)-l)  H0 


^.(u)f2(u) 

/  - — - —  i 

£(u)0£(u)-l) 


l-(F(u) 

F"(u) 


*(«) 


dF(u). 


(5.7)  Remark.  It  is  undesirable  on  philosophical  grounds  to  have  the 
statistical  procedure  depend  on  potential  but  unrealized  censoring.  Pratt 
[2]  has  interesting  comments  on  this  point  and  Cox  [3]  and  Efron  (5)  briefly 
mention  it  as  well. 

"2  2  2 
The  estimator  a  depends  on  unrealized  censoring  times  while  o^, 

*2 

and  o  (defined  below)  do  not.  Nevertheless,  when  all  censoring  times  are 
*2 

known  I  would  use  a  .  I  would  not  throw  out  part  of  the  information  just  to 
avoid  the  above  philosophical  objection. 
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It  is  clear  that  the  values  of  the  unrealized  censoring  tines  do  not 

provide  intrinsic  information  about  survival,  the  attribute  of  interest. 

However  they  do  provide  information  about  the  distribution  of  the  test 

statistic  under  hypothetical  repititions  of  the  experiment.  In  a  frequent is t 

approach  such  information  is  often  useful.  To  a  Bayesian  or  follower  of  the 

liklihood  principle  it  is  irrelevant. 

“2 

(5.8)  Remark.  The  rational  for  a  is  as  follows.  The  problem  of  estimating 

Var„  T  can  be  reduced  in  an  appropriate  sense  to  that  of  estimating 
n0 

R1 (u)  2 

y(u)  =  E  [ (dN. (u)  - dN(u))  |dN(u)),  where  R. ,  R,  N  have  previously  been 

H0  1  R(u)  1 


defined  and  N^t)  is  the  number  of  group  1  uncensored  observations  in  [0,t]. 
The  M  H  estimator  estimates  y(u)  by 


Wu) 


EJh 


(u)  - 


Rj(u) 


R(u)J 


|R1(u),R2(u),dN(u)  ;  o  estimates  y(u)  by 


T(»)  *  b^{Tmi(u)  |*(U)) 


In  comparing  y(u)  to  y  (u)  we  see  that  Var,  (y(u))  £  Var  (y  ,H(u))  while 

•'ll*  iIq  l'in 

CovH  (y(u).y(v))  =  CwH0*Y«l^u)»Y(iBifu));  it  thus  follows  that  VarH  o2  S  Var^o^;. 

2 

To  interpret  this  result  properly  recall  that  o^  is  computable  from  the  data 

"2 

without  censoring  assumptions  while  a  requires  special  assumptions.  When 

“2  "2  2 
a  is  computed  under  specific  censoring  assumptions  then  Var^  a  s  VarH  °  MH 


under  these  assumptions,  but  not  necessarily  more  generally. 

*2 

While  reduction  in  variance  is  welcome  my  main  reason  for  employing  o  is 

2 

that  I  believe  it  will  work  better  than  when  |T|  is  large.  The  difficulty 


in  estimating  y(u)  when  |T|  is  large  is  that  we  need  an  estimate  of  variance 


under  HQ  from  data  which  is  extreme  under  llQ .  We  do  not  want  to  use  the 
observed  variance  Y^C11)  which  we  know  is  untypical  of  Hq.  The  estimator 
y(u)  removes  the  dependence  of  the  variance  estimator  on  R^(u)/R(u)  and 
attempts  to  replace  the  observed  variance  Y^Cu)  by  a  quantity  which  better 
estimates  its  expection,  y(u)< 

The  computation  of  y(u)  is  simple  under  assumptions  which  imply  that 
there  are  a  pool  of  patients  who  are  potential  members  of  the  risk  set  at 
u,  and  the  R(u)  patients  who  are  actually  in  the  risk  set  are  chosen  from 
the  pool  by  random  sampling  without  replacement.  This  structure  characterizes 
the  examples  we  have  so  far  studied. 

(S.9)  Random  Censoring.  If  we  assume  random  censoring  with  distribution  H 

for  group  1  and  I  for  group  2,  then  the  variance  of  T  depends  on  (F,H,I)  in 

“2 

a  complex  way.  The  unbiased  estimator  o  also  depends  on  (F,H,I),  specifically; 


(5.10) 


EH^(R1(u)R2(u)|R(u)-r) 


where ; 

(5.11)  o  *  min(0,m-r),  5  »  max(r,n),  c(u)  ■ 

I(u)(1-F(u)IT(u)) 

"2 

Rather  than  try  to  estimate  o  we  opt  for  a  different  approach.  The 
*2 

estimator  o  requires  knowledge  of  1 2i,  i  -  1,  ....  k.  We  no  longer  know 

the  censoring  times  corresponding  to  uncensored  observations.  However, 

defining  N,  ,  j  •  1,  2  as  the  number  of  group  j  observations  among  the  first 
J  »* 

i  uncensored  observations,  we  know  that  R.  .  .  s  l..  s  R,.  ♦  N.  .  .,  j  »  1,  2. 

1  .l-I  11  11  J  *1*1 
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This  suggests  estimating  by: 


(5.12) 


H(t.) 

■  Rii  *  l  «i 

J  ii(t  ) 


.th 


where  6^  *  1  or  0  depending  on  whether  the  j  uncensored  observation  is 
from  group  1  or  2,  and  H  is  the  Kaplan -Meier  estimator  of  the  censoring 
distribution  for  group  1. 

By  Efron's  (LS3)  interesting  observation  of  the  self-consistency  property 
of  the  Kaplan-Meier  estimator: 


(5.13) 


ln  -  nH(t^) 


(5.14) 


l2i  »  ml(ti) 


We  now  define  an  estimator  a  by  substituting  1 2i,  ♦  L 2i 

“2 


for  an<*  °  (5*5)  obtaining: 

*2  £ 


(5.15) 


I  ^ 


i-1  i.Ui-l)Ri 


The  estimator  o  performs  well  in  the  following  cases.  If  group  1 

observations  are  censored  at  s,  group  2  observations  at  times  which  exceed  s 

*2  th 

then  o  coincides  with  (5.1).  If  censoring  occurs  at  w^,  the  k  smallest 

*2 

order  statistic  in  the  combined  sample,  then  o  coincides  with  (5.2).  In 

*2  2 

the  special  case  k  ■  m+n  (corresponding  to  no  censoring)  o  equals  o^.  If 

*  »  *2 
H  and  I  are  approximately  equal,  indicative  of  equal  censoring,  then  a  will 

2 

be  close  to  o  . 

P 
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*2  *2 

The  estimator  a  is  an  estimate  of  a  which  in  turn  appears  to  be 

2  *2 
superior  to  o^  (see  remark  (5.9).  It  is  hoped  that  o  will  generally  be 

better  than  o2^,  particularly  when  |T|  is  large. 

(5.16)  Remark.  Another  expression  for  Var,,  T  under  random  censoring  is: 


(5.17) 


Varu  T  «  nm  f  H(u)T(u)F2(u)E  \  *M-Ih(u)du 

Ho  [b+b'+2J 


where  B  *v  B(n-l,H(u)F(u)) ,  B1  'v  B(m-l,I(u)F(u))  with  B  and  B'  independent  of 

—  A  A  — 2 

each  other  and  of  R(u).  Estimating  nli(u)  by  l^(u),  ml(u)  by  f2(u),  F  (u)  by 


f(u) (f(u)-l) 


h(u)  by  ,  and  approximating  E  0y  i  in  (5,17)  yields 

R(u)  JB+B  *■*•2  J 


the  estimator  o  (5.15). 

*2  2 

(5.18)  Remark.  Both  o  and  throw  out  observations  which  are  censored 

pxior  to  t1#  that  is  they  treat  the  data  as  if  n  *  ,  m  *  R21  and  no  un¬ 

censored  observations  occur  prior  to  t^ .  This  is  quite  reasonable  and 
desirable.  In  fact  attention  should  be  focussed  on  Var^  (TlR^.R^)  rather 

2  *2 

that  Varu  T,  and  a  and  o  should  be  adjusted  accordingly. 

H0  p 

(5.19)  One  more  censoring  situation.  Termination  of  the  study  occurs  at 

time  Tjfor  group  1  and  T2  for  group  2.  Termination  is  one  cause  of  censoring; 
other  causes  are  present  such  as  withdrawal  from  the  study,  loss  to  follow>-up, 

etc.  A  group  1  patient  entering  the  study  at  time  x  has  a  maximum  censoring 

time  of  Tj  -  x  but  may  be  censored  sooner;  similarly  for  group  2.  Define 

A 

Qj,  i  ■  1,  2  to  be  the  Kaplan-Meier  estimate  of  the  group  i  distribution 
of  censoring  due  to  causes  other  than  termination  of  the  study.  In  estimating 


Qi  the  observations  corresponding  to  death  and  censoring  due  to  termination 
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of  the  study  are  treated  as  censored ,  those  corresponding  to  other  causes 
of  censoring  are  treated  as  uncensored.  Define: 


(S.20) 


(5,21) 


(t),  t  s  T1  -  s 
0,  t  >  Tj  -  s 


(t) 


Q2(t),  t  S  T2  -  s 
0,  t  >  T2  -  s 


Let  denote  the  entrance  time  to  the  study  of  the  patient  with  the 
uncensored  observation  at  .  Finally  5.  a  1  or  0  depending  on  whether  the 
patient  with  the  uncensored  observation  at  t^  was  from  group  1  (<5*1)  or 
2  (6«0).  Estimate  1 2i  by: 


(5.22) 


.  i-1  ^l.x.^i^ 

^li  *  Rli  *  2  £j  ft  . 

Ql,x.(V 


i-1 


(5.23) 


Q2,x. (V 


*2i-R2i+  U'-tpr-1 


(t,) 


a  A  > 

The  values  £2^  are  then  substituted  into  the  expression  for  c  (5,5). 
In  the  above  and  Q2  follow  the  earlier  mentioned  convention  of  left 
rather  than  right  continuous  cdfs. 
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6.  Comments  and  Additions. 


(6.1)  Consider  the  class  of  test  statistics: 


(6.2) 


T„  -  l  gov 

g  l*  1  1 


R,  . 

1  K. 

1 


The  function  g  is  chosen  to  achieve  good  local  power  against  specific 

types  of  alternatives.  The  choice  g(R)  =  R  corresponds  to  Gehan's  16] 

2  2 

generalization  of  tne  Wilcoxon  test.  Our  remarks  concerning  a  and  o^, 

found  in  sections  3  and  4,  carry  over  to  T  .  Tne  estimators  and 

*2 

a  are  now  defined  by: 


(6.3) 


;2  «  y  2  % 

g  L  -  g  (R  ) 

A  1  0  /  0  1  Ml  A 


*.(£.-l)R. 


*2  r  "^l  i^2i  2 

(0.4)  o  2  -  E  .‘V*  1 -  gV.) 

8  i.l 

For  tne  case  g(r)  =  /¥  use  of  (4. Id)  gives  tne  following  expression 


for  the  variance  of  T^  under  known  censoring  times: 


(6.5) 


/£1(u)^2(u)F(u)dF(u) 


Taking  expectations  in  (6.4)  gives  the  variance  under  random  censoring 

(6.6)  nmJti(u)I(u)F(u)dF(u) 

For  the  case  g(r)  =  r  tne  variance  under  known  censoring  times  is 
given  by: 


(6.7) 


j£x  (u)  l2  (u)  F  (u)  L  (£(u)  -2)F  (u)  *2  ]dF  (u) 
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The  variance  under  random  censoring  equals: 

(t>.8)  iunjh  T  F  (n-l)F  h+(m-l)F  T* 2jdF 

working  from  (o.S)  and  (o.7)  by  estimating  tnese  expressions  yield 

*2 

estimators  which  are  practically  tne  same  as  o^  . 

(6.d)  The  MH  approach  as  extended  by  Mantel  [8J  combines  a  class  of  dependent 
chi-square  tables  by  summing  the  variances  of  appropriately  defined  uncorre¬ 
lated  random  variables  across  the  several  tables.  1  believe  that  the  same 
behavior  of  the  variance  estimator  discussed  in  section  3  occurs  more  generally. 
For  a  simple  example  suppose  that  «  n  and  the  conditional  distribution  of 
Na  given  Ni_1  is  BCN^.p),  i  =  1,  2,  ...  . 

We  wish  to  test  p  =  pQ  vs  p  <  p^.  The  appropriate  version  of  the  Mil 
statistic  would  be: 

l(Ni"Ni-lP0)  Mq  -n 

(6-10)  T  *  - - - - *  — - 

where  M  «  I  Ho  °e  reiected  for  T  negative  and  large  in  absolute 

value  equivalently  for  -T  large  and  positive.  But  the  numerator  of  -T, 
n-MqQ,  is  large  when  M  is  small,  in  which  case  the  denominator  ^MPq^q  is 
small.  In  this  case  n-MqQ  and  HpQq0  have  correlation  -1. 

The  true  distribution  of  il  is  negative  binomial  with  parameters  n  and  qQ: 

(6.u)  ?r(«-r)  -  (;;i)p;-rq;,  r  * » 

The  variance  of  Mq^  -  n  under  equals  npQ;  its  unbiased  MH  variance  estimator 
is  Mp0q0. 

The  distortion  of  P  values  in  this  case  is  serious  even  for  fairly  large 
values  of  n. 

The  MH  procedure  is  clever  and  useful,  but  should  be  used  with  care. 


fin  n'fc 
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